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ABSTRACT 

Background:  Anatomic  measures  of  injury  burden  pro¬ 
vide  key  information  for  studies  of  prehospital  and 
in-hospital  trauma  care.  The  military  version  of  the  Ab¬ 
breviated  Injury  Scale  [AIS(M)]  is  used  to  score  injuries 
in  deployed  military  hospitals.  Estimates  of  total  trauma 
burden  are  derived  from  this.  These  scores  are  used  for 
categorization  of  patients,  assessment  of  care  quality, 
and  research  studies.  Scoring  is  normally  performed  ret¬ 
rospectively  from  chart  review.  We  compared  data  re¬ 
corded  in  the  UK  Joint  Theatre  Trauma  Registry  (JTTR) 
and  scores  calculated  independently  at  the  time  of  sur¬ 
gery  by  the  operating  surgeons  to  assess  the  concordance 
between  surgeons  and  trauma  nurse  coordinators  in  as¬ 
signing  injury  severity  scores.  Methods:  Trauma  casual¬ 
ties  treated  at  a  deployed  Role  3  hospital  were  assigned 
AIS(M)  scores  by  surgeons  between  24  September  2012 
and  16  October  2012.  JTTR  records  from  the  same  pe¬ 
riod  were  retrieved.  The  AIS(M),  Injury  Severity  Score 
(ISS),  and  New  Injury  Severity  Score  (NISS)  were  com¬ 
pared  between  datasets.  Results:  Among  32  matched 
casualties,  214  injuries  were  recorded  in  the  JTTR, 
whereas  surgeons  noted  212.  Percentage  agreement  for 
number  of  injuries  was  19%.  Surgeons  scored  75  injuries 
as  “serious”  or  greater  compared  with  68  in  the  JTTR. 
Percentage  agreement  for  the  maximum  AIS(M),  ISS, 
and  NISS  assigned  to  cases  was  66%,  34%,  and  28%, 
respectively,  although  the  distributions  of  scores  were 
not  statistically  different  (median  ISS:  surgeons:  20  [in¬ 
terquartile  range  (IQR),  9-28]  versus  JTTR:  17.5  [IQR, 
9-31.5],  p  =  .7;  median  NISS:  surgeons:  27  [IQR,  12- 
42]  versus  JTTR:  25.5  [IQR,  11.5-41],  p  =  .7).  Conclu¬ 
sion:  There  are  discrepancies  in  the  recording  of  AIS(M) 
between  surgeons  directly  involved  in  the  care  of  trauma 
casualties  and  trauma  nurse  coordinators  working  by 
retrospective  chart  review.  Increased  accuracy  might  be 
achieved  by  actively  collaborating  in  this  process. 

Keywords:  Injury  Severity  Score;  Abbreviated  Injury  Scale ; 
trauma;  surgeon;  trauma  nurse  coordinator 


Introduction 

Coding  is  a  key  process  in  trauma  systems,  providing 
a  standardized  record  of  individual  injuries  and  a  ba¬ 
sis  for  calculation  of  estimates  of  overall  injury  burden. 
Such  scores  are  commonly  reported  as  demographic  pa¬ 
rameters  in  the  trauma  literature  and  are  used  to  assess 
the  performance  and  governance  of  care  within  and  be¬ 
tween  trauma  systems.1,2  Accuracy  and  reproducibility 
in  scoring,  therefore,  are  essential. 

There  are  four  widely  used  scoring  systems  of  importance 
in  the  assessment  of  the  burden  of  traumatic  injuries:  (1) 
the  Abbreviated  Injury  Scale  (AIS)3,4;  (2)  a  military  ad¬ 
aptation  of  the  AIS  [AIS(M)],5  which  has  been  reported 
to  predict  mortality  for  casualties  injured  on  military 
operations  more  accurately  than  the  civilian  version6; 
(3)  the  Injury  Severity  Score  (ISS)7;  and  (4)  the  New  In¬ 
jury  Severity  Score  (NISS).8  These  scores  are  calculated 
using  objective  and  consistent  measures.  The  first  two 
systems  are  based  on  the  assignment  of  injury  severity 
to  individual  injuries  within  defined  anatomic  regions 
by  matching  the  injury  against  detailed  descriptions 
within  the  relevant  dictionary;  the  remaining  systems  are 
derivatives  of  these.  Illustrative  injuries  for  the  various 
AIS(M)  grades  are  shown  in  Table  1,  and  Figure  1  shows 


Table  1  Examples  of  Injuries  Assigned  Various  Abbreviated 
Injury  Scale  (Military  Version )  Scores 


Score 

Severity 

Example 

1 

Minor 

Isolated  rib  fracture 

2 

Moderate 

Testicular  avulsion 

3 

Serious 

Simple  hemothorax 

4 

Severe 

Below-knee  traumatic 
amputation 

5 

Critical 

Femoral  artery  injury  with 
>20%  blood  loss 

6 

Maximum 

Torso  transsection 
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Figure  1  Injury  scoring  processes. 


Note:  AIS(M),  Abbreviated  Injury  Score  (Military  version);  ISS,  Injury 
Severity  Score;  NISS,  New  Injury  Severity  Score. 


the  relationship  between  these  scoring  systems.  NISS  has 
been  shown  to  be  a  more  accurate  predictor  of  mortal¬ 
ity  and  morbidity  in  various  clinical  settings,9-13  and  it  is 
suggested  that  NISS  reflects  military  injury  burden  more 
accurately  than  ISS,14  especially  if  derived  from  AIS(M). 

Previous  studies  have  demonstrated  large  variation  in 
the  number  of  injuries  identified  per  case15  in  civilian 
trauma.  Compared  with  a  reference  set  of  cases,  up 
to  31%  of  injuries  may  be  missed  by  trained  coders.16 
Agreement  for  specific  codes  assigned  to  individual  inju¬ 
ries  may  be  as  low  as  3 9%. 16,17  This  variation  may  lead 
to  limited  reproducibility  of  estimation  of  overall  injury 
burden,  with  one  study  having  found  the  probability  of 
any  two  raters  assigning  the  same  ISS  being  as  low  as 
28%  (rising  to  only  51%  agreement  if  cases  are  placed 
in  bands  of  severity).18  However,  variation  in  coding  of 
specific  injuries  may  not  compromise  the  reproducibil¬ 
ity  of  overall  measures  of  trauma  burden:  in  a  study  of 
six  trained  raters,  despite  only  39%  agreement  for  al¬ 
located  AIS  codes,  the  intraclass  correlation  coefficient 
(ICC)  for  ISS  was  “almost  perfect  (ICC  0.90). ”17  In  con¬ 
trast,  a  study  of  10  raters  found  that  the  limits  of  agree¬ 
ment  for  every  pairing  of  raters  exceeded  “clinically 
acceptable”  bounds  (defined  by  the  authors  as  ±9  ISS 
or  NISS  units).16  Despite  these  limitations,  ISS  remains 
the  most  commonly  reported  measure  of  injury  sever¬ 
ity  in  the  trauma  literature,  and  AIS(M),  ISS,  and  NISS 
are  routinely  recorded  in  the  JTTR.  Ultimately,  all  three 
measures  depend  on  correct  identification  and  coding 
of  individual  injuries.  The  perspective  of  a  surgeon  who 
has  seen  an  injury  firsthand  and  that  of  a  rater  scoring 
from  chart  review  may  differ.  This  could  lead  to  differ¬ 
ent  scores  being  assigned  to  the  same  injury. 

The  aim  of  this  study  was  to  compare  AIS(M)  scores  and 
derivatives  assigned  contemporaneously  by  the  treat¬ 
ing  clinicians  at  a  UK  deployed  military  medical  facility 
against  entries  for  the  same  trauma  casualties  recorded  in 
the  JTTR.  Based  on  anecdotal  and  personal  experience  of 
working  with  JTTR  data,  we  hypothesized  that  scoring  by 


physicians  with  clinical  responsibility  for  patients  would 
differ  from  JTTR  entries  resulting  from  chart  review. 

Methods 

Setting 

The  Joint  Force  Medical  Group  Hospital  at  Camp  Bas¬ 
tion  was  a  UK-led,  coalition  medical  facility  in  Hel- 
mand  Province  in  southern  Afghanistan,  which  provided 
medical  support  to  counterinsurgency  operations  by  the 
International  Security  Assistance  Force.  Military  wound¬ 
ing  patterns  dominated  admissions  to  this  facility,  with 
75%  of  admissions  with  NISS  >16  injured  by  explosive 
mechanisms  and  23%  by  gunshot;  2%  were  injured  by 
the  blunt  trauma  more  typical  of  civilian  injuries.19 

The  UK  JTTR 

The  UK  JTTR  is  a  prospectively  gathered  database  of 
all  patients  (regardless  of  affiliation)  who  trigger  activa¬ 
tion  of  the  trauma  team  at  a  deployed  UK  field  hospital 
and  of  all  UK  Service  personnel  evacuated  to  the  UK.20 
Trauma  nurse  coordinators  (TNCs)  record  and  code  in¬ 
juries  after  treatment  at  the  deployed  facility,  based  on 
chart  review  and  imaging.  ISS  and  NISS  are  calculated 
from  AIS(M). 

Study  Design 

In  this  preliminary,  prospective,  observational  study, 
physicians  coded  a  patient’s  injuries  immediately  after 
treating  them.  Rather  than  investigating  possible  differ¬ 
ences  in  medical  and  nursing  interpretation  of  written 
patient  records,  this  design  allowed  the  authors  to  in¬ 
vestigate  whether  the  definitive  JTTR  record  accurately 
recorded  the  injuries  identified  by  the  physicians. 

Selection  Criteria 

Patients  were  included  in  the  study  if  they  triggered 
trauma  team  activation  (any  casualty  assigned  the  high¬ 
est  triage  category  or  for  whom  team  activation  was  ini¬ 
tiated  by  any  member  of  staff).  Cases  were  excluded  if 
they  had  not  suffered  injury  (e.g.,  a  patient  allocated  a 
high  triage  category  due  to  a  medical  emergency). 

Registration  and  Approval 

This  study  was  authorized  by  and  registered  with  the 
Royal  Centre  for  Defence  Medicine  (RCDM/Res/au- 
dit/1036/12/0264)  and  with  the  US  Army  Joint  Combat 
Casualty  Research  Team  as  a  performance  improvement 
project  for  which  institutional  review  board  approval 
was  not  required.  We  adhered  to  the  Guidelines  for  Re¬ 
porting  Reliability  and  Agreement  Studies.21 

Statistics 

The  ISS  and  NISS  generated  by  each  method  were  found 
to  be  asymmetrically  distributed  around  their  medians 
by  using  the  Shapiro-Wilk  test.  Therefore,  medians  were 
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compared  by  using  the  sign  test.  Reliability  analysis  was 
performed  by  grouping  the  scores  according  to  reviewer 
type  (clinician  versus  TNC)  and  assuming  each  group 
consisted  of  scores  by  a  single  reviewer.  Interrater  agree¬ 
ment  was  examined  with  the  Bland-Altman  limits  of 
agreement  method,  with  reproducibility  defined  as  95% 
of  differences  lying  within  2  standard  deviations  (SDs)  of 
the  mean.22 

Interrater  agreement  and  reliability  were  further  exam¬ 
ined  by  using  weighted  k  and  ICC  statistics,  respectively. 
Weighted  k  statistics  treat  the  assigned  scores  as  ordinal 
data  and  provide  an  index  of  agreement  between  the 
two  raters  (clinicians  and  TNCs).  The  k  estimates  were 
based  on  squared  weights:  the  squared  distance  from 
perfect  agreement  determined  the  weight  assigned  to 
any  disagreement  between  raters.23 

To  examine  consistency  rather  than  absolute  agreement, 
and  for  comparison  with  similar  studies,16,17  interrater 
reliability  of  maximum  AIS(M),  ISS,  and  NISS  was  as¬ 
sessed  by  using  ICC  estimates.  We  assumed  cases  were 
drawn  from  a  larger  pool  of  casualties  but  that  raters 
were  fixed,  and  thus  estimated  single-measures  ICC 
using  a  one-way  random-effects  model.  ICC  estimates 
were  reported  with  95%  confidence  intervals  (CIs).  Our 
predetermined  Cl  width  was  95%,  in  keeping  with  com¬ 
mon  practice16  and  for  comparison  with  other  published 
results.  We  used  the  Cl  to  reflect  the  level  of  uncertainty 
associated  with  the  estimates,  not  to  imply  statistical  sig¬ 
nificance;  our  study  was  not  powered  to  do  so. 

Both  weighted  k  and  ICC  were  interpreted  by  using  the 
arbitrary  method  of  Landis  and  Koch.24  Statistics  were 
calculated  using  R,  version  3.10  (R  Foundation  for  Sta¬ 
tistical  Computing;  www.r-project.org)  with  the  Meth- 
Comp25  and  irr26  libraries. 

Data  Collection 

During  a  3-week  period  from  24  September  2012  to  16 
October  2012,  attending  surgeons  who  operated  on  these 
patients  were  provided  with  a  copy  of  the  AIS(M)  diction¬ 
ary  and  asked  to  code  each  injury  with  which  they  had 
been  involved.  In  cases  where  the  patient  underwent  sur¬ 
gery,  this  coding  took  place  as  soon  after  the  initial  opera¬ 
tion  as  was  practical.  ISS  and  NISS  were  calculated  from 
these  codes.  In  parallel  (and  independent  of  the  study  in¬ 
vestigators),  JTTR  data  collection  continued  as  normal, 
undertaken  by  TNCs.  JTTR  scores  were  retrieved  via  a 
standard  request  for  release  of  data  analyzed  after  data 
collection  was  complete.  The  following  data  were  identi¬ 
fied  from  each  source:  AIS(M)  for  each  injury,  number  of 
injuries  in  each  body  region  (by  AIS  definition),  highest 
AIS(M)  in  each  body  region,  number  of  injuries  for  each 
severity  grade,  total  number  of  injuries,  ISS,  and  NISS.  All 
matched  cases  were  included  in  analysis. 


Results 

Data  were  collected  by  study  investigators  for  32  pa¬ 
tients  who  received  trauma-team  activation  during 
this  period.  Patient  characteristics  are  shown  in  Table 
2.  The  JTTR  recorded  “motor  vehicle  collision”  as  the 
mechanism  of  injury  in  three  patients;  however,  two  of 
these  were  unequivocally  casualties  from  motor  vehicles 
struck  by  improvised  explosive  devices  (IEDs).  Four  ad¬ 
ditional  records  of  “major  trauma”  casualties  (ISS  >16) 
were  found  in  the  JTTR  for  this  period.  Although  these 
would  have  been  expected  to  meet  inclusion  criteria, 
they  had  not  been  identified  by  the  investigators.  Conse¬ 
quently,  no  ratings  by  clinical  staff  had  been  performed, 
and  they  could  not  be  included  in  analysis. 


Table  2  Fatient  Characteristics 


Characteristic 

Data 

Male  sex,  no.  (%) 

30  (94) 

Age,  median  (IQR)  [range],  y 

22  (21-27)  [3-42] 

Affiliation,  no.  (%) 

Afghan  Security  Forces 

15  (47) 

ISAF 

11  (34) 

Civilian 

6(19) 

Mechanism  of  injury,  no.  (%) 

Explosion 

19  (59) 

Gunshot 

10  (31) 

Burns 

2(6) 

Motor  vehicle  collision 

1(3) 

Note :  IQR,  interquartile  range;  ISAF,  International  Security  Assistance 
Force 


The  median  ISS  recorded  by  investigators  was  20  (inter¬ 
quartile  range  [IQR],  9-28)  compared  with  17.5  (IQR, 
9-31.5)  for  the  JTTR  (p  =  .7).  Investigators  recorded  a 
median  NISS  of  27  (IQR,  12-42)  while  the  JTTR  data 
had  a  median  of  25.5  (IQR,  11.5-41;  p  =  .7).  The  32 
casualties  had  sustained  214  injuries  as  recorded  in  the 
JTTR,  whereas  the  study  investigators  noted  212  inju¬ 
ries.  The  percentage  agreement  for  number  of  injuries 
recorded  was  19%,  with  a  difference  of  up  to  14  inju¬ 
ries  per  anatomic  region  for  the  entire  cohort  (Table  3). 
Injuries  were  recorded  for  98  separate  body  regions  in 
at  least  one  of  the  datasets.  Percentage  agreement  for 
the  highest  regional  AIS(M)  was  51%.  Study  investiga¬ 
tors  recorded  75  injuries  scored  as  “serious”  or  greater 
[AIS(M)  >3],  whereas  the  JTTR  had  68  such  injuries  re¬ 
corded.  Percentage  agreement  for  the  maximum  AIS(M) 
assigned  to  cases  [maxAIS(M)]  was  66%  (Table  4),  with 
disagreement  by  one  AIS(M)  grade  in  10  of  11  cases 
(91%).  The  remaining  case  differed  by  two  grades. 

Percentage  agreements  for  identical  ISS  and  NISS  were 
34%  and  28%,  respectively.  Reproducibility  criteria  were 
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Table  3  Total  Injuries  Recorded  in  Each  Anatomic  Region 
Compared  Between  Joint  Theatre  Trauma  Registry  (JTTR) 
and  Study  Investigators 


AIS  Region 

Injuries  Recorded 
According  to 
JTTR,  No. 

Injuries  Recorded 
by  Investigators, 
No. 

Head 

25 

11 

Face 

23 

14 

Neck 

0 

1 

Thorax 

19 

16 

Abdomen 

29 

37 

Spine 

9 

4 

Upper  extremity 

29 

35 

Lower  extremity 

75 

81 

External 

5 

13 

Other 

0 

0 

Note:  AIS,  Abbreviated  Injury  Scale. 


not  met,  with  differences  in  scores  lying  outside  2  SDs  of 
the  mean  difference  in  two  of  32  cases  (6.25%)  (Figure 
2  and  3).  However,  there  was  no  evidence  of  a  system¬ 
atic  difference  in  assigned  ISS  and  NISS  values  within 
our  sample. 

Weighted  k  indexes  of  interrater  agreement  of  0.88  ( p 
<  .001)  “almost  perfect,”  0.41  (p  =  .02)  “moderate,” 
and  0.63  (p  <  .001)  “substantial”  were  obtained  for 
maxAIS(M),  ISS,  and  NISS,  respectively.  ICC  indexes  of 
consistency  in  single  scores  were  “almost  perfect,”  with 
ICCs  of  0.89  (95%  Cl,  0.78-0.94),  0.94  (95%  Cl,  0.88- 
0.97),  and  0.84  (95%  Cl,  0.70-0.92)  for  maxAIS(M), 
ISS,  and  NISS,  respectively.  At  the  lower  bounds  of  the 
Cl,  the  reliability  for  maxAIS(M)  and  NISS  remained 
“substantial”  (ICC,  0.61-0.8024).  Summary  statistics  for 


Figure  2  Reproducibility  of  Injury  Severity  Score  (ISS) 
between  Joint  Theatre  Trauma  Registry  (JTTR)  and 
investigators. 


Figure  3  Reproducibility  of  New  Injury  Severity  Score 
(NISS)  between  Joint  Theatre  Trauma  Registry  (JTTR) 
and  investigators. 
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injury  scores  by  reviewer,  weighted  k,  and  ICC  statistics 
are  given  in  Table  5.  The  distributions  of  comparative 
ISS  and  NISS  scores  are  shown  in 
Figures  4  and  5. 


Discussion 

This  is  the  first  study  to  assess 
interobserver  variability  of  in¬ 
jury  severity  scoring  in  a  modern 
combat  hospital.  The  main  find¬ 
ing  of  this  study  is  that,  during 
the  period  of  observation,  there 
were  discrepancies  in  the  AIS(M) 
recorded  for  the  same  patients 
between  TNCs  and  treating  clini¬ 
cians.  In  addition  to  disagreement 
in  injury  scores,  there  were  further 
differences  in  the  total  number  of 
injuries  and  in  both  regional  and 
total  injury  burdens.  However, 
this  should  not  be  interpreted  as  a 


Table  4  Percentage  Agreement  and  Interobserver  Reliability  of  Maximum  Regional 
Abbreviated  Injury  Scale  (Military  Version) 


Region 

Agreement, 

% 

ICC  (95%  Cl) 

Degree  of  Agreement  at 
Lower  Cl  Extreme 

Head 

87.5 

0.908 

(0.902-0.976) 

Almost  perfect 

Face 

81.3 

0.746 

(0.544-0.867) 

Moderate 

Neck 

96.9 

0.000 

(-0.340-0.342) 

None 

Thorax 

84.4 

0.953 

(0.906-0.977) 

Almost  perfect 

Abdomen 

75.0 

0.881 

(0.772-0.940) 

Substantial 

Spine 

93.8 

0.579 

(0.297-0.769) 

Fair 

Upper 

extremity 

81.3 

0.883 

(0.775-0.941) 

Substantial 

Lower 

extremity 

68.8 

0.767 

(0.578-0.879) 

Moderate 

External 

81.3 

0.844 

(0.707-0.921) 

Substantial 

Note :  Cl,  confidence  interval;  ICC,  intraclass  correlation  coefficient. 
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Table  5  Summary  Statistics  for  Injury  Severity  Scores  by  Reviewer  Group* 


Score 

Reviewer 

Mean 

Median 

SD 

Weighted  k 

TCC 

(95%  Cl) 

MaxAIS(M) 

Surgeon 

3.53 

4 

1.39 

o 

oo 

oo 

57 

A 

b 

o 

i-^ 

0.89 

(0.78-0.94) 

TNC 

3.53 

4 

1.39 

ISS 

Surgeon 

22.4 

17 

18.5 

0.63  (p  <  .001) 

0.94 

(0.88-0.97) 

TNC 

22.5 

20 

18.2 

NISS 

Surgeon 

29.2 

25.5 

21.7 

0.41 

(p  =  .020) 

0.84 

(0.70-0.92) 

TNC 

30.6 

27.0 

20.9 

Note:  *  Interrater  agreement  and  reliability  for  each  score  given  by  weighted  k  and  ICC,  respec¬ 
tively.  Cl,  confidence  interval;  ICC,  intraclass  correlation  coefficient;  ISS,  Injury  Severity  Score; 
MaxAIS(M),  maximum  Abbreviated  Injury  Scale  (Military  version);  NISS,  New  Injury  Severity 
Score;  SD,  standard  deviation;  TNC,  trauma  nurse  coordinator. 

Figure  4  Comparison  of  Injury  Severity  Score  (ISS)  between 
Joint  Theatre  Trauma  Registry  (JTTR)  and  investigators. 
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Figure  5  Comparison  of  New  Injury  Severity  Score  (NISS) 
between  Joint  Theatre  Trauma  Registry  (JTTR)  and 
investigators. 


major  blood  loss).  No  investiga¬ 
tion  of  the  ability  of  surgeons  or 
TNCs  to  estimate  and  attribute 
traumatic  blood  loss  was  identi¬ 
fied  from  the  literature. 

When  comparing  anatomic  re¬ 
gions,  the  study  investigators 
tended  to  miss  injuries  to  the 
head  and  face.  This  may  relate 
to  surgeons  scoring  shortly  af¬ 
ter  initial  damage  control  op¬ 
erations,  where  attention  had 
been  focused  on  torso  and  limb 
injuries  with  only  a  provisional 
radiographic  diagnosis  of  head  and  facial  injuries  avail¬ 
able.  In  contrast,  the  definitive  radiology  report  was 
available  for  JTTR  coding.  Surgeons  tended  to  record 
more  torso,  limb,  and  external  injuries  than  TNCs.  This 
may  be  an  effect  of  direct  involvement  with  casualties 
(and  would  be  consistent  with  previous  reports  that 
surgeon  involvement  improves  data  quality),27  whereas 
conventional  coding  relies  on  the  level  of  detail  recorded 
in  written  records  and  the  ability  to  match  that  detail  to 
the  specific  descriptors  within  the  relevant  AIS  diction¬ 
ary.  Such  discrepancies  between  individuals  when  calcu¬ 
lating  injury  severity  scores  are  not  unique;  variation  in 
calculation  of  ISS  of  up  to  80%  from  the  mean  has  been 
noted.28  In  a  large  study  from  Queensland,  Australia,  six 
raters  independently  coding  120  sets  of  notes  achieved 
almost  perfect  agreement  for  ISS.17  However,  that  vali¬ 
dation  exercise  was  based  on  repeated  examination  of 
standardized  data  rather  than  the  comparison  of  visual¬ 
ized  injuries  against  written  records. 


Despite  the  differences  in  this  study  between  investiga¬ 
tor  and  JTTR  AIS(M)  scores,  there  were  no  statistically 
significant  differences  between  the  groups  in  the  derived 
ISS  and  NISS  for  these  casualties.  This  may  suggest  that 
discrepancies  in  AIS(M)  do  not  affect  the  predictive  and 
prognostic  function  of  the  JTTR  data.  This  is  consistent 
with  a  previous  report  of  “almost  perfect”  interrater 
reliability  despite  only  36%  agreement  regarding  AIS.17 
However,  this  current  study  is  small;  statistically  signifi¬ 
cant  differences  might  be  identified  if  a  larger  cohort  of 
patients  were  examined  as  part  of  a  study  design  that 
removed  the  potential  for  confounding  discussed  in  the 
limitations  section. 


comment  on  the  relative  abilities  of  TNCs  and  surgeons 
to  score  injuries  accurately.  Rather,  our  results  likely  rep¬ 
resent  a  difference  in  the  perception  of  an  injury  visualized 
at  the  time  of  surgery  compared  with  its  written  record 
in  medical  documentation.  This  may  lead  to  differences 
in  both  the  description  of  an  injury  and  the  interpreta¬ 
tion  of  secondary  features  required  for  accurate  coding 
(e.g.,  the  identification  of  specific  injuries  responsible  for 


Our  suggestion  that  the  differences  in  assigned  scores 
arise  from  different  exposures  to  the  casualty  is  consis¬ 
tent  with  previous  findings  that  involving  trauma  sur¬ 
geons  in  coding  at  a  Level  I  trauma  center  resulted  in 
coding  amendments  in  5.2%  of  cases  scored  by  non¬ 
surgeon  coders  alone,  with  71%  of  these  amendments 
being  an  upward  revision  of  ISS.27  If  accurate  scoring 
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in  the  military  context  is  to  be  achieved,  then  active  in¬ 
volvement  of  the  surgeon  will  be  essential.  The  authors 
consider  it  vital  that  deployed  TNCs  feel  able  to  seek 
clarification  from  the  operating  surgeon,  while  surgeons 
could  contribute  to  improvement  by  describing  injuries 
in  language  consistent  with  the  dictionary  when  writing 
the  operative  record.  Together,  the  TNC  and  the  sur¬ 
geon  should  generate  a  consensus  injury  description  to 
describe  the  injury  in  terms  of  the  most  accurate  “fit” 
in  the  scoring  dictionary.  Existing  collective  predeploy¬ 
ment  training  provides  an  ideal  opportunity  to  estab¬ 
lish  collaboration  by  the  inclusion  of  suitable  casualty 
scenarios.29 

A  further  potential  contributor  to  our  findings  is  that 
UK  TNCs  are  trained  for  specific  operations  but  rarely 
have  significant  exposure  to  trauma  data  capture  before 
deployment  and  are  unlikely  to  return  subsequently  to 
that  role.  Detailed  training  in  anatomy,  radiology  report 
writing,  and  injury  description  is  required  to  undertake 
the  TNC  role  effectively.  Establishing  a  Defence  Medi¬ 
cal  Services  TNC  cadre,  working  within  the  civilian  UK 
Major  Trauma  Networks,  would  allow  participation  in 
the  ongoing  UK  Trauma  Audit  &  Research  Network 
(TARN)  activity,  leading  to  increased  expertise  in  the 
matching  AIS(M)  descriptions  to  computed  tomography 
scan  reports,  surgical  operation  notes,  and  postmortem 
studies.  While  civilian  and  military  wounding  patterns 
differ,  such  exposure  would  enable  TNCs  to  deploy  with 
greater  experience  in  trauma  audit  and  governance. 

A  secondary  finding  of  this  study  is  that  the  JTTR  may 
not  accurately  record  mechanism  of  injury  data  when  an 
IED  strike  to  a  military  vehicle  precipitates  a  rollover  or 
other  accident  in  which  patients  sustain  injuries  typical 
of  a  more  conventional  motor  vehicle  incident.  Finally, 
the  failure  of  investigators  to  identify  and  record  all  eli¬ 
gible  patients  almost  certainly  relates  to  their  primary 
responsibility  of  providing  surgical  care  to  the  wounded. 
This  reinforces  the  importance  of  the  deployed  hospital 
establishment  including  registry  personnel  whose  role  is 
dedicated  to  information  capture.  This  principle  should 
also  apply  to  data  capture  for  research  projects  in  the 
deployed  environment. 

Study  Limitations 

Data  collection  for  this  study  was  conducted  by  two 
British  military  surgical  residents  in  the  second  half  of  a 
7-week  deployment.  This  brief  operational  tour  limited 
the  sample  size  that  could  be  collected,  but  it  was  con¬ 
sidered  sufficient  for  a  preliminary  investigation.  It  is 
possible  that  this  period  was  unrepresentative  of  normal 
activity  and  that  greater  casualty  flow  might  have  af¬ 
fected  the  accuracy  of  physician  coding  by  reducing  the 
time  available  to  match  injuries  to  detailed  descriptions 
in  the  AIS(M)  dictionary.  However,  the  experience  of  the 


senior  authors  from  multiple,  longer  deployments  sug¬ 
gests  that  the  activity  level  was  not  unusual. 

A  further  limitation  is  that  our  selection  criteria  focused 
on  patients  likely  to  have  more  severe  injuries.  This 
could  lead  to  greater  apparent  disparity  between  physi¬ 
cians  and  registry  staff,  as  external  injuries  are  predomi¬ 
nantly  minor  and  would  be  easily  recognized  as  such  by 
both  groups.  However,  these  more  complex  injuries  are 
precisely  those  in  which  detail  may  be  lost  in  the  written 
record  and,  thus,  are  most  relevant  to  our  hypothesis. 

This  study  did  not  verify  the  accuracy  of  the  surgeons’ 
assessment  of  injuries,  and  none  of  our  investigators  had 
been  formally  certified  in  AIS  methodology.  However, 
the  UK  surgical  training  syllabus  requires  competence  in 
the  use  of  trauma  scoring  systems,30  and  previous  work 
has  found  no  significant  difference  in  performance  be¬ 
tween  trained  and  untrained  raters.18 

This  study  does  not  identify  whether  the  difference 
in  assigned  scores  arises  from  variability  between  in¬ 
dividual  raters,  from  documentation  that  does  not 
convey  a  sufficient  level  of  detail  to  permit  accurate 
retrospective  coding,  or  from  a  different  interpretation 
of  injuries  arising  from  the  different  professional  back¬ 
grounds  of  physicians  and  registry  personnel.  Because 
our  study  was  intended  to  compare  the  perceptions  of 
the  treating  clinicians  with  registry  output,  it  was  not 
possible  to  have  the  same  clinicians  score  each  case. 
Logistical  constraints  meant  the  sample  size  was  in¬ 
sufficient  to  perform  multilevel  model-based  analysis 
of  cross-classified  data31  (not  all  cases  assessed  by  all 
reviewers)  that  could  account  for  this.  This  should  be 
studied  further  to  refine  registry  process  and  maximize 
data  quality  and  analysis. 

Conclusion 

This  study  highlights  discrepancies  in  injury  severity 
scoring  between  surgeons  and  TNCs,  with  surgeons 
recording  more  torso,  limb,  and  external  injuries,  and 
TNCs  including  more  injuries  to  the  head  and  face.  Be¬ 
cause  injury  scoring  is  vital  to  the  analysis  of  the  efficacy 
of  interventions  at  all  stages  of  the  trauma  care  pathway, 
from  point  of  wounding  to  discharge  from  rehabilita¬ 
tion,  it  is  important  to  minimize  such  differences.  Both 
TNCs  and  treating  clinicians  should  be  actively  involved 
in  the  injury  scoring  of  patients.  Collaboration  will  im¬ 
prove  both  accuracy  and  reliability. 
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